Your Transformer is Secretly an EOT Solver
🧠LLM Inference
Flag this post
Accelerating AI inferencing with external KV Cache on Managed Lustre
cloud.google.com·4h
🏗️LLM Infrastructure
Flag this post
Text case changes the size of QR codes
johndcook.com·4h
📝Text Compression
Flag this post
The Smallest PNG
📝Text Compression
Flag this post
How We Saved 70% of CPU and 60% of Memory in Refinery’s Go Code, No Rust Required.
🔬Rust Profiling
Flag this post
From Lossy to Lossless Reasoning
🔤Tokenization
Flag this post
audiot909、国産アマピアノの金字塔作『JAPANESE AMAPIANO THE ALBUM』がLP化
news.jp·10h
🎯Vector Quantization
Flag this post
Phase diagram map of ferroelectric properties unlocked with AI in seconds
phys.org·3h
🌏BGE Embeddings
Flag this post
Quanta Services, Inc. (PWR) Q3 2025 Earnings Call Transcript
seekingalpha.com·23h
🔍EXPLAIN ANALYZE
Flag this post
Run Multimodal Reasoning Agents with NVIDIA Nemotron on vLLM
blog.vllm.ai·20h
🏗️LLM Infrastructure
Flag this post
Made a simple fine-tuning tool
📋Markdown
Flag this post
Rearchitecting Vector Search: A Migration from MongoDB Atlas to Qdrant
pub.towardsai.net·13h
🎯Qdrant
Flag this post
Tencent/WeKnora
github.com·18h
🔎Meilisearch
Flag this post
Show HN: rstructor, Pydantic+instructor for Rust
🔄Serde
Flag this post
ClairS-TO: a deep-learning method for long-read tumor-only somatic small variant calling
nature.com·5h
🏗️LLM Infrastructure
Flag this post
Researchers advance cross-modality smart security with transformer model
techxplore.com·17h
🔗Hybrid Search
Flag this post
Vectorized Context-Aware Embeddings for GAT-Based Collaborative Filtering
arxiv.org·16h
🌏BGE Embeddings
Flag this post
MIT’s Survey On Accelerators and Processors for Inference, With Peak Performance And Power Comparisons
semiengineering.com·3h
🏗️LLM Infrastructure
Flag this post
Loading...Loading more...